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DETECTION OF CYP3A4 AND CYP2C9 POLYMORPHISMS 

The present invention is directed to methods of preparing biological samples for 
nucleic acid analysis using oligonucleotide primers suitable for amplification of the genes 
5 encoding the drug-metabolizing cytochrome P450 enzymes CYP3A4 and CYP2C19. 

BACKGROUND OF THE INVENTION 
Xenobiotics are pharmacologically, endocrinological^, or toxicologically active 
substances foreign to a biological system. Most xenobiotics, including pharmaceutical 
agents, are metabolized through two successive reactions. Phase I reactions 

10 (functionalization reactions), include oxidation, reduction, and hydrolysis, in which a 
derivatizable group is added to the original molecule. Functionalization prepares the 
drug for further metabolism in phase II reactions. During phase II reactions (conjugative 
reactions, which include glucoronidation, sulfation, methylation and acetylation), the 
functionalized drug is conjugated with a hydrophilic group. The resulting hydrophilic 

15 compounds are inactive and excreted in bile or urine. Thus, metabolism can result in 
detoxification and excretion of the active substance. Alternatively, an inert xenobiotic 
may be metabolized to an active compound. For example, a pro-drug may be converted 
to a biologically active therapeutic or toxin. 

The cytochrome P450 (CYP) enzymes are involved in the metabolism of many 

20 different xenobiotics. CYPs are a superfamily of heme-containing enzymes, found in 
eukaryotes (both plants and animals) and prokaryotes, and are responsible for Phase I 
reactions in the metabolic process. In total, over 500 genes belonging to the CYP 
superfamily have been described and divided into subfamilies, CYP1-CYP27. In 
humans, more than 35 genes and 7 pseudogenes have been identified. Members of three 

25 CYP gene families, CYP1, CYP2, and CYPS, are responsible for the majority of drug 
metabolism. The human CYPs which are of greatest clinical relevance for the 
metabolism of drugs and other xenobiotics are CYP1 A2, CYP2A6, CYP2C9, CYP2C19, 
CYP2D6, CYP2E1 and CYP3A4. The liver is the major site of activity of these 
enzymes, however CYPs are also expressed in other tissues. 
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The most important drug-metabolizing CYP enzyme is CYP3A4, which is the 
major CYP expressed in liver. Expression of the gene encoding CYP3A4 (CYP3A4) is 
inducible by many commonly used drugs, such as dexamethasone, rifampicin, and 
clotrimazole. CYP3A4 is estimated to metabolize more than 60% of all drugs in clinical 
5 use, including calcium channel blockers such as nifedipine, immunosuppressants such as 
cyclosporin A, macrolide antibiotics such as erythromycin, and steroid hormones. In 
addition, CYP3 A4 metabolizes some carcinogens, and may be implicated in an 
individual's susceptibility to such toxins. 

The existence of more than one form of the CYP3A4 enzyme is caused by 

10 polymorphisms in the gene which encodes the CYP3A4 enzyme (the gene being denoted 
in italics, as CYP3A4). In fact, almost 20 polymorphisms in the CYP3A4 gene have been 
described (see http://www.imm.ki.se/cvpalleles/ for listing). The distribution of 
particular CYP3A4 polymorphisms differs among ethnic groups, however, concomitant 
differences in CYP3 A4 activity and responses to drugs which are CYP3 A4 substrates 

15 remain to be investigated. CYP3A4*1A is the wild type gene, corresponding to the cDNA 
having GenBank Accession No. A18907 and the genomic DNA having GenBank 
Accession No. AF280107. A number of mutations in the 5' untranslated region of 
CYP3A4 have been described. CYP3A4HB is an A to G substitution at position -392. 
CYP3A4*1C is a T to G substitution at position -444. CYP 3 A4* ID is a C to A 

20 substitution at position -62. CYP 3 A4* IE is a T to A substitution at position -369. 

CYP3A4*1F is a C to G substitution at -747. The 5' flanking region of CYP3A4 is set 
forth in SEQ ID NO:l and in Figure 1. 

WO 01/20025 discloses single nucleotide polymorphisms in various exons, 
introns, and in the 3' UTR of CYP3A4, as well as oligonucleotides for use in diagnosing 

25 and treating abnormal expression and/or function of this gene. WO 00/24926 discloses 
oligonucleotides for use in detecting an A to G point mutation at position -290 of 
CYP3A4. WO 99/13106 discloses polymorphisms in CYP3A4, including an A to G 
substitution at position -392 of the promoter, at the 7 th position of the 10 bp NFSE, within 
oligonucleotides having sequences ACAAGGGCAAGAGAGAGGC (SEQ ID NO:2) 
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and ACAAGGGCAGGAGAGAGGC (SEQ ID NO:3), with polymorphic variants 

indicated in bold type. 

U.S.Pat.No. 6,174,684 and corresponding WO 00/09752 disclose an A to G 

variant in the nifedipine-specific regulatory element located at positions -287 to -296 of 
5 CYP3A4, which is associated with increased risk of prostate cancer and with increased 

risk of developing leukemia after administration of an epipodophyllotoxin. U.S.Pat.No. 

6,174,684 also discloses the oligonucleotides AGGGCAAGAG (SEQ ID NO:4) and 

AGGGCAGGAG (SEQ ID NO:5), with polymorphic variants indicated in bold type. 

Rebbeck, et ah (1998) J. Natl. Cancer Inst. 90, 1225-1229 also describes this association 
10 between prostate cancer, leukemia, and the A to G mutation. 

Kuehl, et al. (2001) Nature Genetics 27, 383-391 discloses mutations at positions 

-341, -288, and -43 of the CYP3A4 promoter, none of which were associated with altered 

CYP3 A4 activity. Kuehl, et al. also discloses differential distribution of these 

polymorphisms among Caucasians and African Americans. 
15 A second important CYP enzyme is CYP2C9, which is active in hydroxylation of 

such drugs as tolbutamide, phenytoin, S-warfarin, diclofenac, ibuprofen, and losarten. 

The sequence of CYP2C9 is set forth in SEQ ID NO:6. Six variants in CYP2C9 are 

described on the CYP web site, and another six variant designations are listed without 

descriptions. The CYP2C9*1 variant is designated as the wild type. Four of the five 
20 polymorphic CYP2C9 forms described contain mutations in the coding regions of the 

gene that result in decreased in vitro activity, and the remaining variant, CYP2C9*6, is a 

deletion of an A at position 818 which results in a frame shift. 

WO00/12757 discloses primer extension assays and kits for detection of the 

single nucleotide polymorphisms CYP2C9*2 and CYP2C9*3 9 both of which result in 
25 amino acid substitutions. 

On the basis of ability of metabolize a marker drug such as nifedipine for 

CYP3 A4 or S-warfarin for CYP2C9, individuals may be characterized as poor 

metabolizers (PM), intermediate metabolizers (IM), extensive metabolizers (EM) or ultra 

extensive metabolizers (UEM or UM) for CYP3A4 or CYP2C9 substrates, respectively. 
30 Poor metabolizers retain the substrate in their bodies for a relatively long period of time, 



3 



WO 02/18641 



PCT/IB01/01580 



and are susceptible to toxicity and side effects at "normal" dosages. Ultraextensive 
metabolizers clear the substrate from their bodies quickly, and require higher than 
"normal" dosages to achieve a therapeutic effect. Intermediate and extensive 
metabolizers retain the substrate in their bodies for times between those of PMs and 
5 UEMs, and are more likely to respond to "normal" dosages of the drug. However, 
individuals characterized as IM or EM may differ in drug clearance by as much as 10- 
fold, and variations in toxicity, side effects, and efficacy for a particular drug may occur 
among these individuals. However, administration of such drugs to determine an 
individual's metabolic capacity may in itself be dangerous, exposing the individual to 
10 potential toxic side effects. 

A need remains for methods of preparing biological samples that contain the 5 1 
flanking regions of CYP3A4 or CYP2C9, so that this information may be used to predict 
differential capacities for metabolizing CYP3 A4 and CYP2C9 substrates among 
individuals. 
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SUMMARY OF THE INVENTION 

The present inventors have discovered a novel single nucleotide polymorphism in 
the 5' flanking region of CYP3A4, and six novel polymorphisms in the 5 'flanking region 
of CYP2C9, Oligonucleotides have been devised for amplification of the polymorphic 
5 regions corresponding to these polymorphisms. These oligonucleotides may be used to 
prepare biological samples for further analysis of the 5' flanking regions of these genes. 
The inventors have also devised sequence determination oligonucleotides for use as 
probes for the novel single nucleotide polymorphisms in CYP3A4 and CYP2C9. 

In one embodiment, the invention provides an oligonucleotide primer pair suitable 
10 for amplifying a polymorphic region of a 5' flanking region of a CYP3A4 gene, wherein 
the polymorphic region corresponds to position 461 of SEQ ID NO:l, which position 
may also be described as position -644 from the transcription start site of the CYP3A4 
gene. 

In another embodiment, the invention provides a sequence determination 

15 oligonucleotide for detecting a polymorphic site in a 5' flanking region of a CYP3A4 
gene, said oligonucleotide being complementary to the polymorphic region 
corresponding to position 461 of SEQ ID NO:l. 

In another embodiment, the invention provides a kit for amplification and/or 
detection of a polymorphic region of the 5' flanking region of a CYP3A4 gene, said kit 

20 comprising at least one oligonucleotide primer pair capable of amplifying the region 
corresponding to position 46 1 of SEQ ID NO: 1 . 

In another embodiment, the invention provides an oligonucleotide primer pair 
suitable for amplifying a polymorphic region of a 5 'flanking region of a CYP2C9 gene, 
wherein the polymorphic region corresponds to position 957 of SEQ ID NO:6; position 

25 1049 of SEQ ID NO:6; position 1 164 of SEQ ID NO:6; position 1526 of SEQ ID NO:6; 
position 1661 of SEQ ID NO:6; and position 1662 of SEQ ID NO:6. Position 957 of 
SEQ ID NO:6 may also be described as position -1189 from the transcription start site of 
the CYP3C9 gene; position 1049 of SEQ ID NO:6 may also be described as position 
-1097 from the transcription start site; position 1164 of SEQ ID NO:6 may also be 

30 described as position -982 from the transcription start site; position 1526 of SEQ ID 
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NO:6 may also be described as position -620 from the transcription start site; position 
1661 of SEQ ID NO:6 may also be described as position -485 from the transcription start 
site; and position 1662 of SEQ ID NO:6 may also be described as position -484 from the 
transcription start site. 

5 In yet another embodiment, the invention provides a sequence determination 

oligonucleotide for detecting a polymorphic site in a 5' flanking region of a CYP2C9 
gene, said oligonucleotide comprising a sequence selected from the group consisting of 
an oligonucleotide complementary to the polymorphic region corresponding to position 
957 of SEQ ID NO:6; an oligonucleotide complementary to the polymorphic region 

10 corresponding to position 1049 of SEQ ID NO:6; an oligonucleotide complementary to 
the polymorphic region corresponding to position 1164 of SEQ ID NO:6; an 
oligonucleotide complementary to the polymorphic region corresponding to position 
1526 of SEQ ID NO:6; an oligonucleotide complementary to the polymorphic region 
corresponding to position 1661 of SEQ ID NO:6; and an oligonucleotide complementary 

15 to the polymorphic region corresponding to position 1662 of SEQ ID NO:6. 

In another embodiment, the invention provides a kit for amplification and/or 
detection of a polymorphic region corresponding to at least one polymorphic region in 
the 5' flanking region of the CYP2C9 gene, said region being selected from the group 
consisting of position 957 of SEQ ID NO:6; position 1049 of SEQ ID NO:6; position 

20 1 164 of SEQ ID NO:6; position 1526 of SEQ ID NO:6; position 1661 of SEQ ID NO:6; 
and position 1662 of SEQ ID NO:6. 



BRIEF DESCRIPTION OF THE DRAWINGS 

25 Figure 1 shows the sequence of the 5' flanking region of the CYP3A4 gene as set 

forth in SEQ ID NO:l, with the novel polymorphic site underlined and highlighted in 
bold. 

Figure 2 shows the sequence of the 5 'flanking region of the CYP2C9 gene as set 
forth in SEQ ID NO:6, with the novel polymorphic sites underlined and highlighted in 
30 bold. 
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DETAILED DESCRIPTION OF THE INVENTION 

The U.S. patents and publications referenced herein are hereby incorporated by 
reference. 

For the purposes of the invention, certain terms are defined as follows. 
5 "Gene" is defined as the genomic sequence of the CYP2C19 gene. 

"Oligonucleotide" means a nucleic acid molecule preferably comprising from 
about 8 to about 50 covalently linked nucleotides. More preferably, an oligonucleotide of 
the invention comprises from about 8 to about 35 nucleotides. Most preferably, an 
oligonucleotide of the invention comprises from about 10 to about 25 nucleotides. In 

10 accordance with the invention, the nucleotides within an oligonucleotide may be analogs 
or derivatives of naturally occurring nucleotides, so long as oligonucleotides containing 
such analogs or derivatives retain the ability to hybridize specifically within the 
polymorphic region containing the targeted polymorphism. Analogs and derivatives of 
naturally occurring oligonucleotides within the scope of the present invention are 

15 exemplified in U.S. Pat. Nos. 4,469,863; 5,536,821; 5,541,306; 5,637,683; 5,637,684; 

5,700,922; 5,717,083; 5,719,262; 5,739,308; 5,773,601; 5,886,165; 5,929,226; 5,977,296; 
6,140,482; WO 00/56746; WO 01/14398, and the like. Methods for synthesizing 
oligonucleotides comprising such analogs or derivatives are disclosed, for example, in the 
patent publications cited above and in U.S. Pat. Nos. 5,614,622; 5,739,314; 5,955,599; 

20 5,962,674; 6,117,992; in WO 00/75372, and the like. The term "oligonucleotides" as 
defined herein also includes compounds which comprise the specific oligonucleotides 
disclosed herein, covalently linked to a second moiety. The second moiety may be an 
additional nucleotide sequence, for example, a tail sequence such as a polyadenosine tail 
or an adaptor sequence, for example, the phage M13 universal tail sequence, and the like. 

25 Alternatively, the second moiety may be a non-nucleotidic moiety, for example, a moiety 
which facilitates linkage to a solid support or a label to facilitate detection of the 
oligonucleotide. Such labels include, without limitation, a radioactive label, a fluorescent 
label, a chemiluminescent label, a paramagnetic label, and the like. The second moiety 
may be attached to any position of the specific oligonucleotide, so long as the 
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oligonucleotide retains its ability to hybridize to the polymorphic regions described 
herein. 

A polymorphic region as defined herein is a portion of a genetic locus that is 
characterized by at least one polymorphic site. A genetic locus is a location on a 
5 chromosome which is associated with a gene, a physical feature, or a phenotypic trait. A 
polymorphic site is a position within a genetic locus at which at least two alternative 
sequences have been observed in a population. A polymorphic region as defined herein 
is said to "correspond to" a polymorphic site, that is, the region may be adjacent to the 
polymorphic site on the 5' side of the site or on the 3' side of the site, or alternatively may 

10 contain the polymorphic site. A polymorphic region includes both the sense and 

antisense strands of the nucleic acid comprising the polymorphic site, and may have a 
length of from about 100 to about 5000 base pairs. For example, a polymorphic region 
may be all or a portion of a regulatory region such as a promoter, 5" UTR, 3' UTR, an 
intron, an exon, or the like. A polymorphic or allelic variant is a genomic DNA, cDNA, 

15 mRNA or polypeptide having a nucleotide or amino acid sequence that comprises a 

polymorphism. A polymorphism is a sequence variation observed at a polymorphic site, 
including nucleotide substitutions (single nucleotide polymorphisms or SNPs), insertions, 
deletions, and microsatellites. Polymorphisms may or may not result in detectable 
differences in gene expression, protein structure, or protein function. Preferably, a 

20 polymorphic region of the present invention has a length of about 1000 base pairs. More 
preferably, a polymorphic region of the invention has a length of about 500 base pairs. 
Most preferably, a polymorphic region of the invention has a length of about 200 base 
pairs. 

A haplotype as defined herein is a representation of the combination of 
25 polymorphic variants in a defined region within a genetic locus on one of the 

chromosomes in a chromosome pair. A genotype as used herein is a representation of the 
polymorphic variants present at a polymorphic site. 

The PCR primer pairs of the invention are capable of amplifying the polymorphic 
region corresponding to position 461 of SEQ ID NO:l, or any of the polymorphic regions 
30 corresponding to position 957 of SEQ ID NO:6; position 1049 of SEQ ID NO:6; position 
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1 164 of SEQ ID NO:6; position 1526 of SEQ ID NO:6; position 1661 of SEQ ID NO:6; 
and position 1662 of SEQ ID NO: 6. Specific oligonucleotide primer pairs of the 
invention, for amplifying position 461 of SEQ ID NO:l, may comprise sequences 
selected from the group consisting of SEQ ID NO:7 and SEQ ID NO:8; and SEQ ID 
5 NO;9 and SEQ ID NO: 10. For amplifying only position 957 of SEQ ED NO:6, an 
oligonucleotide primer pair comprising the sequences set forth in SEQ ID NO:19 and 
SEQ ID NO:20 may be used. Alternatively, positions 957 and 1049 of SEQ ID NO:6 
may be amplified using an oligonucleotide primer pair comprising the sequences set forth 
in SEQ ID NO:21 and SEQ ID NO:22; or positions 957, 1049, and 1164 may be 

10 amplified using an oligonucleotide primer pair comprising the sequences set forth in SEQ 
ID NO:23 and SEQ ID NO:24. Position 1164 of SEQ ID NO:6 may also be amplified 
using an oligonucleotide primer pair comprising the sequences set forth in SEQ ID 
NO:25 and SEQ ID NO:26. Positions 1526, 1661, and 1662 of SEQ ID NO:6 may be 
amplified using an oligonucleotide primer pair comprising the sequences set forth in SEQ 

15 ID NO:27 and SEQ ID NO:28. Positions 1661 and 1662 of SEQ ID NO:6 may be 

amplified using an oligonucleotide primer pair selected from the group consisting of an 
oligonucleotide primer pair comprising the sequences set forth in SEQ ID NO:29 and 
SEQ ID NO:30 and an oligonucleotide primer pair comprising the sequences set forth in 
SEQ ID NO:31 and SEQ ID NO:32. 

20 Each of the PGR primer pairs of the invention may be used in any PCR method. 

For example, a PCR primer pair of the invention may be used in the methods disclosed in 
U.S.PatNos. 4,683,195; 4,683,202, 4,965,188; 5,656,493; 5,998,143; 6,140,054; WO 
01/27327; WO 01/27329; and the like. The PCR pairs of the invention may also be used 
in any of the commercially available machines that perform PCR, such as any of the 

25 GeneAmp® Systems available from Applied Biosystems. 

The oligonucleotides of the invention may be used to determine the sequence of 
the polymorphic regions of SEQ ID NO: 1 or SEQ ID NO:6 as defined herein. In one 
embodiment, an oligonucleotide of the invention comprises a sequence selected from the 
group consisting of SEQ ID NO:ll; SEQ ID NO: 12; SEQ ID NO: 13; SEQ ID NO: 14; 

30 SEQ ID NO:15; SEQ ID NO:16; SEQ ID NO:17; and SEQ ID NO:18, for determining 
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the sequence of the novel polymorphic region of CYP3A4 corresponding to position 461 
of SEQ ED NO: 1. In another embodiment, for determining the sequence of the 
polymorphic region of CYP2C9 corresponding to position 957 of SEQ ID NO:6, an 
oligonucleotide of the invention comprises a sequence selected from the group consisting 
5 of SEQ ED NO:33; SEQ ID NO:34; SEQ ID NO:43; SEQ ID NO:44; SEQ ID NO:53; 
SEQ ID NO:58; SEQ ID NO:63; and SEQ ID NO:68. In another embodiment, for 
determining the sequence of the polymorphic region of CYP2C9 corresponding to 
position 1049 of SEQ ID NO:6, an oligonucleotide of the invention comprises a sequence 
selected from the group consisting of SEQ ID NO:35; SEQ ID NO:36; SEQ ID NO:45; 

10 SEQ ID NO:46; SEQ ID NO:54; SEQ ID NO:59; SEQ ID NO:64; and SEQ ED NO:69. 
In another embodiment, for determining the sequence of the polymorphic region of 
CYP2C9 corresponding to position 1164 of SEQ ID NO:6, an oligonucleotide of the 
invention comprises a sequence selected from the group consisting of SEQ ID NO: 37; 
SEQ ID NO:38; SEQ ID NO:45; SEQ ID NO:48; SEQ ED NO:55; SEQ ID NO:60; SEQ 

15 ID NO:65; and SEQ ID NO:70. In another embodiment, for determining the sequence of 
the polymorphic region of CYP2C9 corresponding to position 1526 of SEQ ED NO:6, an 
oligonucleotide of the invention comprises a sequence selected from the group consisting 
of SEQ ID NO:39; SEQ ID NO:40; SEQ ED NO:49; SEQ ED NO:50; SEQ ID NO:56; 
SEQ ID NO:61; SEQ ID NO:66; and SEQ ID NO:71. In another embodiment, for 

20 determining the sequences of the polymorphic region of CYP2C9 corresponding to either 
of positions 1661 or 1662 of SEQ ID NO:6, an oligonucleotide of the invention 
comprises a sequence selected from the group consisting of SEQ ED NO:41; SEQ ID 
NO:42; SEQ ID NO:51; SEQ ID NO:52; SEQ ID NO:57; SEQ ED NO:62; SEQ ID 
NO:67; and SEQ ID NO:72. 

25 Those of ordinary skill will recognize that oligonucleotides complementary to the 

polymorphic regions described herein must be capable of hybridizing to the polymorphic 
regions under conditions of stringency such as those employed in primer extension-based 
sequence determination methods, restriction site analysis, nucleic acid amplification 
methods, ligase-based sequencing methods, methods based on enzymatic detection of 

30 mismatches, microarray-based sequence determination methods, and the like. The 
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oligonucleotides of the invention may be synthesized using known methods and 
machines, such as the ABF M 3900 High Throughput DNA Synthesizer and the Expedite™ 
8909 Nucleic Acid Synthesizer, both of which are available from Applied Biosystems 
(Foster City,CA). 

5 The oligonucleotides of the invention may be used, without limitation, as in situ 

hybridization probes or as components of diagnostic assays. Numerous oligonucleotide- 
based diagnostic assays are known. For example, primer extension-based nucleic acid 
sequence detection methods are disclosed in U.S.Pat.Nos. 4,656,127; 4,851,331; 
5,679,524; 5,834,189; 5,876,934; 5,908,755; 5,912,118; 5,976,802; 5,981,186; 6,004,744; 

10 6,013,431; 6,017,702; 6,046,005; 6,087,095; 6,210,891; WO 01/20039; and the like. 
Primer extension-based nucleic acid sequence detection methods using mass 
spectrometry are described in U.S.Pat.Nos. 5,547,835; 5,605,798; 5,691,141; 5,849,542; 
5,869,242; 5,928,906; 6,043,031; 6,194,144, and the like. The oligonucleotides of the 
invention are also suitable for use in ligase-based sequence determination methods such 

15 as those disclosed in U.S.Pat.Nos. 5,679,524 and 5,952,174, WO 01/27326, and the like. 
The oligonucleotides of the invention may be used as probes in sequence determination 
methods based on mismatches, such as the methods described in U.S.PatNos. 5,851,770; 
5,958,692; 6,110,684; 6,183,958; and the like. In addition, the oligonucleotides of the 
invention may be used in hybridization-based diagnostic assays such as those described 

20 in U.S.Pat.Nos. 5,891,625; 6,013,499; and the like. 

The oligonucleotides of the invention may also be used as components of a 
diagnostic micro array. Methods of making and using oligonucleotide microarrays 
suitable for diagnostic use are disclosed in U.S.PatNos. 5,492,806; 5,525,464; 5,589,330; 
5,695,940; 5,849,483; 6,018,041; 6,045,996; 6,136,541; 6,142,681; 6,156,501; 6,197,506; 

25 6,223,127; 6,225,625; 6,229,911; 6,239,273; WO 00/52625; WO 01/25485; WO 
01/29259; and the like. 

The invention is also embodied in a kit comprising at least one oligonucleotide 
primer pair of the invention. When the kit is used for amplification and detection of 
CYP3A4 polymorphisms, it will comprise an oligonucleotide primer pair suitable for 

30 amplification of the polymorphic region corresponding to position 461 of SEQ ID NO: 1. 
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Specific primer pairs in this embodiment are selected from the group consisting of SEQ 
ID NO:7 and SEQ ID NO:8; and SEQ ID NO;9 and SEQ ID NO: 10. This embodiment of 
the kit of the invention may optionally comprise a sequence determination 
oligonucleotide selected from the group consisting of SEQ ID NO:ll; SEQ ID NO: 12; 
5 SEQ ID NO:13; SEQ ID NO:14; SEQ ID NO:15; SEQ ID NO:16; SEQ ID NO:17; and 
SEQ ID NO: 18. 

When the kit of the invention is used for amplification and detection of 
polymorphisms in the 5' flanking region of CYP2C9, it will comprise at least one 
oligonucleotide primer pair, wherein the primer pair is capable of amplifying a 

10 polymorphic region selected from the group consisting of the polymorphic region 

corresponding to position 957 of SEQ ID NO:6; the polymorphic region corresponding to 
position 1049 of SEQ ID NO:6; the polymorphic region corresponding to position 1164 
of SEQ ID NO:6; the polymorphic region corresponding to position 1526 of SEQ ID 
NO:6; the polymorphic region corresponding to position 1661 of SEQ ID NO:6; and the 

15 polymorphic region corresponding to position 1662 of SEQ ID NO: 6. This embodiment 
may optionally further comprise a sequence determination oligonucleotide for detecting a 
polymorphic variant at any or all of the polymorphic sites corresponding to positions 957, 
1049, 1164, 1526, 1661 and 1662 of SEQ ID NO:6. 

The kit of the invention may also comprise a polymerizing agent, for example, a 

20 thermostable nucleic acid polymerase such as those disclosed in U.S.Pat.Nos. 4,889,818; 
6,077,664, and the like. The kit of the invention may also comprise chain elongating 
nucleotides, such as dATP, dTTP, dGTP, dCTP, and dlTP, including analogs of dATP, 
dTTP, dGTP, dCTP and dTTP, so long as such analogs are substrates for a thermostable 
nucleic acid polymerase and can be incorporated into a growing nucleic acid chain. The 

25 kit of the invention may also include chain terminating nucleotides such as ddATP, 

ddTTP, ddGTP, ddCTP, and the like. In a preferred embodiment, the kit of the invention 
comprises at least two oligonucleotide primer pairs, a polymerizing agent, chain 
elongating nucleotides, at least two sequence determination oligonucleotides and at least 
one chain terminating nucleotide. The kit of the invention may optionally include 

30 buffers, vials, microtiter plates, and instructions for use. 

12 
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The examples set forth below are provided as illustration and are not intended to 
limit the scope and spirit of the invention as specifically embodied therein. 
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EXAMPLE 1 

IDENTIFICATION OF CYP3A4 POLYMORPHISM 

The study was performed in accordance with the principles stated in the 
Declaration of Helsinki as reviewed in Tokyo 1975 and Venice 1983, Hong Kong 1989 
5 and Somerset West 1996. Ten samples (Swedish Caucasians) were selected and used for 
identification of polymorphisms in the 5' flanking region of CYP3A4. 

White blood cells isolated from a blood sample drawn from the brachial vein 
serve as the source of the genomic DNA for the analyses. The DNA was extracted by 
guanidine thiocyanate method or QIAamp Blood Kit (QIAGEN, Venlo, The 

10 Netherlands). The genes included in the study were amplified by PCR and the DNA 
sequences were determined by full sequencing. All genetic analyses were performed 
according to Good Laboratory Practice and Standard Operating Procedures. Case Report 
Forms were designed and used for clinical and genetic data collection. Data was entered 
and stored in a relational database at Gemini Genomics AB, Uppsala. To secure 

15 consistency between the Case Report Forms and the database, data was checked either by 
double data entry or proofreading. After a Clean File was declared the database was 
protected against changes. By using the program Stat/Transfer™ the database was 
transferred to SAS data sets. The SAS™ system was used for tabulations and statistical 
evaluations. Genotypes were also correlated against the metabolic ratio. 

20 PCR-fragments were amplified with TaqGOLD polymerase (Applied Biosystems) 

using Robocycler (Stratagene) or GeneAmp PCR system 9700 (Applied Biosystems). 
Preferentially, the amplified fragments were 300-400 bp, and the region to be read did not 
exceed 300 bp. PCR reactions were carried out according to the basic protocol set forth in 
Table 1, with modifications as indicated in Table 2 for specific primer pairs, which are 

25 shown in Table 3. For the GeneAmp PCR 9700 machine the profile used was 10 minutes at 
95°, 40 x (45 seconds at 90°, 45 seconds at 60°, 45 seconds at 72°), 5 minutes at 72° and 22° 
until removed. 

30 
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Table 1 



Solution 


mock concentration 


JrCK (pi) 


H 2 0 




33.2 


PCR buffer 


lOx 


5.0 


MgCl 2 


25 mM 


2.0 


dNTP 


2.5 mM 


2.5 


primer 1 


IOjiM 


1.0 


primer 2 


lOjiM 


1.0 


Taq-gold 
polymerase 


5n/Ml 


0.3 


DNA samples 


2ng/pl 


5.0 


TOTAL 




50.0 



Table 2 



SEQID 
NO:s 


Polymorphic 
Site 


Modification from basic protocol (Table 1) 


Detection method 


7,8 


461 


62° annealing temperature 


Full sequencing 


9, 10 


461 


3 \Xl MgCl 2 , 58° annealing temperature, 50 
cycles 


Full sequencing 



Table 3 



Polymorphic 
Site 


Primer Pair 


461 


SEQ ID NO:7 CCAGCCTGAAAGTGCAGAGA 
SEQ ID NO:8 TCTTAGAGTCTTTCCTCACCAAACT 


461 


SEQ ID NO:9 CATGCCCTGTCTCTCCTTTA 
SEQ ID NO: 10 CCATCCCCTTCATGCAATC 



The optimized condition specified in Table 2 were required to distinguish CYP3A4 
10 from the closely related gene-family members CYP3A5, and CYP3A7. Use of the basic 

protocol will lead to problems when amplifying CFP3A4-specific amplicons of 300-400 bp 
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containing the polymorphisms of interest, unless a nested PCR approach is carried out. The 
nested PCR approach was not used because of the high risk of contamination when using a 
nested PCR approach and the high risk of typing errors as a consequence. The modifications 
shown in Table 2 were optimized and reaction parameters were balanced in such a way that 
nested PCR was avoided. 

For full sequencing, one of the PCR-primers in a primer pair was designed for 
sequencing by addition of a 29 nucleotide tail complementary to Ml 3 at its 5 '-end, 
namely the nucleotides AGTCACGACGTTGTAAAACGACGGCCAGT. Thus, the 
entire PCR-product was sequenced from the tailed PCR-primer. 

The additional oligonucleotides set forth in Tables 4 through 7 were identified as 
being suitable for detection of the SNP at positions 461 of the 5' flanking region of the 
CYP3A4 gene as depicted in SEQ ID NO:l. 

Table 4 sets forth oligonucleotides representing the coding (sense) strand 
complementary to the polymorphic region corresponding to the novel polymorphism 
found in the study population. The underlined letter indicates polymorphic position in the 
sequence context. All sequences are shown in 5' to 3' direction. 

Table 4 



Polymorphic 
Site 


Sequence 


Note 


461 


SEQ ID NO: 1 1 : AGCACCCTGGT 
SEQ ID NO: 12: AGCACGCTGGT 


C variant 
G variant 



Table 5 sets forth oligonucleotides representing the non-coding (anti-sense) strand 
complementary to the polymorphic region corresponding to the novel polymorphism 
found in the study population. The underlined letter indicates polymorphic position in the 
sequence context. All sequences are shown in 5' to 3' direction. 

Table 5 



Polymorphic 
Site 


Sequence 


Note 


461 


SEQ ID NO: 13: ACCAGGGTGCT 
SEQ ID NO: 14: ACCAGCGTGCT 


Antisense G variant 
Antisense C variant 
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The sequences of Table 6 represent the 5 5 -sequence to the novel polymorphic site 
on the coding (sense) strand (SEQ ID NO: 15) and non-coding (anti-sense) strand (SEQ 
ID NO:s 16). All sequences are shown in 5' to 3' direction. 



Table 6 



Polymorphic 
Site 


Sequence 


Note 


461 


SEQ ID NO: 15 : GTGTGT AC AGC 
SEQ ID NO: 16: GCTGTACACAC 


Sense 5* 
Antisense 5* 



The sequences of Table 7 represent the 3 '-sequence to the novel polymorphic site 
on the non-coding (anti-sense) strand (SEQ ID NO: 17) and the coding (sense) strand 
(SEQ ID NO: 18). All sequences are shown in 5' to 3' direction. 



Table 9 



Polymorphic 
Site 


Sequence 


Note 


461 


SEQ ID NO:17: TGGTCCCTACC 
SEQ ID NO:18: GGTAGGGACCA 


Antisense 3' 
Sense 3' 
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EXAMPLE 2 

IDENTIFICATION OF CYP2C9 POLYMORPHISMS 

The study was performed in accordance with the principles stated in the 
Declaration of Helsinki as reviewed in Tokyo 1975 and Venice 1983, Hong Kong 1989 
5 and Somerset West 1996. Ten samples (Swedish Caucasians) were selected and used for 
identification of polymorphisms in the 5' flanking region of CYP2C9. 

White blood cells isolated from a blood sample drawn from the brachial vein 
serve as the source of the genomic DNA for the analyses. The DNA is extracted by 
guanidine thiocyanate method or QIAamp Blood Kit (QIAGEN, Venlo, The 

10 Netherlands). The genes included in the study were amplified by PGR and the DNA 
sequences were determined by full sequencing. All genetic analyses were performed 
according to Good Laboratory Practice and Standard Operating Procedures. Case Report 
Forms were designed and used for clinical and genetic data collection. Data was entered 
and stored in a relational database at Gemini Genomics AB, Uppsala. To secure 

15 consistency between the Case Report Forms and the database, data was checked either by 
double data entry or proofreading. After a Clean File was declared the database was 
protected against changes. By using the program Stat/Transfer™ the database was 
transferred to SAS data sets. The SAS™ system was used for tabulations and statistical 
evaluations. Genotypes were also correlated against the metabolic ratio. 

20 PCR-fragments were amplified with TaqGOLD polymerase (Applied Biosystems) 

using Robocycler (Stratagene) or GeneAmp PCR system 9700 (Applied Biosystems). 
Preferentially, the amplified fragments were 300-400 bp, and the region to be read did not 
exceed 300 bp. PCR reactions were carried out according to the basic protocol set forth in 
Table 10, with modifications as indicated in Table 11 for specific primer pairs, which are 

25 shown in Table 12. For the GeneAmp PCR 9700 machine the profile used was 10 minutes 
at 95°, 40 x (45 seconds at 90°, 45 seconds at 60°, 45 seconds at 72°), 5 minutes at 72° and 
22° until removed. 



30 



18 



WO 02/18641 



PCT/IB01/01580 



Table 10 



{Solution 


Stock Concentration 


PCR 


H 2 0 




33.2 


PCR buffer 


lOx 


5.0 


MgCl 2 


25 mM 


2.0 


dNTP 


2.5 mM 


2.5 


primer 1 


lOuM 


1.0 


primer 2 


lOuM 


1.0 


Taq-gold 
polymerase 


5u/m-1 


0.3 


DNA samples 


2ng/|xl 


5.0 


TOTAL 




50.0 



Table 11 



SEQBD 
NO:s 


Polymorphic Site 


Modification from basic protocol (Table 10) 


Detection method 


19, 20 


957 


58° annealing temperature 


Full sequencing 


21,22 


957 & 1049 


3 [il MgCl 2 , 62° annealing temperature 


Full sequencing 


23,24 


957, 1049 & 1164 


58° annealing temperature 


Full sequencing 


25, 26 


1164 


3 \ll MgCl 2 , 62° annealing temperature , 50 
cycles 


Full sequencing 


27, 28 


1526, 1661 & 1662 




Full sequencing 


29, 30 


1661 & 1662 


3 ul MgCl 2 , 62° annealing temperature , 50 
cycles 


Full sequencing 


31,32 


1661 & 1662 




Full sequencing 



5 



10 
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Table 12 



Polymorphic 
Site 


Primer Pair 


957 


SEQ ID NO: 19 CACTAGGGAATTTAGAACAAATATG 
SEQ ID NO:20 GCACAGAAAGCAAAGGAAATTAT 


957 & 1049 


SEQ ID NO:21 TGTATTTAGATCCTCAACTCAGTATGT 
SEQ ID NO:22 GGATCTCCCTTCTCCATCACT 


957, 1049 8c 1164 


SEQ ID NO:23 GGTCCATTTAGTGATTTCCCTAC 
SEQ ID NO:24 ATACACCACATTTATTCTGTTCATA 


1164 


SEQ ID NO:25 CCAAATTTTTCCCTCAGTTACA 
SEQ ID NO:26 TTGGTGCCACACAGCTCATA 


1526, 1661 & 1662 


SEQ ID NO:27 GCCTTCAGGAATTTTTTTTA 

SEQ ID NO:28 CCAGTTGGGAATATATGATTTAACA 


1661 & 1662 


SEQ ID NO:29 GCTGCTGTATTTTTAGTAGGCTATA 
SEQ ID NO:30 CGTTCCATTGTCCACTCTGTAC 


1661 & 1662 


SEQ ID NO:3 1 TCAAGGC AGCTCTGGTGT AA 

SEQ ID NO:32 AGTTGGGAATATATGATTTAACAGA 



The optimized condition specified in Table 11 were required to distinguish CYP2C9 
from the closely related gene-family members CYP2C8, CYP2C18 and CYP2C19. Use of 
5 the basic protocol will lead to problems when amplifying CFP2C9-specific amplicons of 
300-400 bp containing the polymorphisms of interest, unless a nested PGR approach is 
carried out. The nested PCR approach was not used because of the high risk of 
contamination when using a nested PCR approach and the high risk of typing errors as a 
consequence. The modifications shown in Table 11 were optimized and reaction parameters 

10 were balanced in such a way that nested PCR was avoided. 

For full sequencing, one of the PCR-primers in a primer pair was designed for 
sequencing by addition of a 29 nucleotide tail complementary to M13 at its 5'-end, 
namely the nucleotides AGTCACGACGTTGTAAAACGACGGCCAGT, Thus, the 
entire PCR-product was sequenced from the tailed PCR-primer. 

15 The additional oligonucleotides set forth in Tables 13 through 16 were identified 

as being suitable for detection of the SNPs at positions 957, 1049, 1164, 1526, 1661 
and/or 1662 of the 5' flanking region of the CYP2C9 gene as depicted in SEQ ID NO:6. 
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Table 13 sets forth oligonucleotides representing the coding (sense) strand 
complementary to the polymorphic region corresponding to the polymorphisms found in 
the study population. The underlined letter indicates polymorphic position in the 
sequence context. All sequences are shown in 5' to 3' direction. 
5 Table 13 



Polymorphic 
Site 


Sequence 


Note 


95J 


SEQ ID NO:33 
SEQ ID NO:34 


ATCTTCTATTG 
ATCTTTTATTG 


C variant 
T variant 


1049 


SEQ ID NO:35, 
SEQ ID NO:36< 


ACAATAGAAAG 
ACAATGGAAAG 


A variant 

G variant 1 


1164 


SEQ ID NO:37 
SEQ ID NO:38 


ATGGAGAAGGG 
ATGGAAAAGGG 


G variant 
A variant 


1526 


SEQ ID NO:39 
SEQ ID NO:40 


TTAATGGTAAA 
TTAATTGTAAA 


G variant 
T variant 


1661 & 1662 


SEQ ED NO:41 
SEQ ID NO:42 


GGATTTCATTAT 
GGATTAAATTAT 


TC variants 
AA variants 



Table 14 sets forth oligonucleotides representing the non-coding (anti-sense) 
10 strand complementary to the polymorphic region corresponding to the polymorphisms 

found in the study population. The underlined letter indicates polymorphic position in the 
sequence context. All sequences are shown in 5' to 3' direction. 

Table 14 



Polymorphic 
Site 


Sequence 


Note 


957 


SEQ ID NO:43 
SEQBDNO:44 


CAATAGAAGAT 
CAATAAAAGAT 


Antisense G variant 
Antisense A variant 


1049 


SEQ ID NO:45 
SEQ ID NO:46 


CTTTCTATTGT 
CTTTCCATTGT 


Antisense T variant 
Antisense C variant 


1164 


SEQ ID NO:47 
SEQ ID NO:48 


CCCTTCTCCAT 
CCCTTTTCCAT 


Antisense C variant 
Antisense T variant i 


1526 


SEQ ID NO:49 
SEQIDNO:50 


TTTACCATTAA 
TTTACAATTAA 


Antisense C variant 
Antisense A variant 


1661 & 
1662 


SEQIDNO:51 
SEQ ID NO:52 


ATAATGAAATCC 
ATAATTTAATCC 


Antisense GA variants 
Antisense TT variant 



15 

The sequences of Table 15 represent the 5 '-sequence to the polymorphic sites on 
the coding (sense) strand (SEQ ID NO:s 53-57) and non-coding (anti-sense) strand (SEQ 
ID NO:s 58-67). All sequences are shown in 5' to 3' direction. 
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Table 15 



Polymorphic 
Site 


Sequence 


Note 


957 


SEQ ID NO:53 
SEQ ID NO:58 


TACCTCCCATC 
GATGGGAGGTA 


Sense 5* 
Antisense 5' 


1049 


SEQ ID NO:54 
SEQ ID NO:59 


AACCAAAAACA 
TGTTTTTGGTT 


Sense 5' 
Antisense 5' 


1164 


SEQ ID NO:55 
SEQ ID NO:60 


CTGCAGTGATG 
CATCACTGCAG 


Sense 5* 
Antisense 5' 


1526 


SEQ ID NO:56 
SEQIDNO:61 


TAGGGGGTTTA 
TAAACCCCCTA 


Sense 5' 
Antisense 5' 


1661 & 1662 


SEQ ID NO:57 
SEQ ID NO:62 


ATTTGAAAGGA 
TCCTTTCAAAT 


Sense 5' 
Antisense 5' 



5 The sequences of Table 16 represent the 3 '-sequence to the polymorphic sites on 

the non-coding (anti-sense) strand (SEQ ID NO:s 63-67) and the coding (sense) strand 
(SEQ ID NO:s 68-72). All sequences are shown in 5' to 3' direction. 

Table 16 

10 



Polymorphic 
Site 


Sequence 


Note 


957 


SEQ ID NO:63 
SEQ ID NO:68 


TGTGGATGCAA 
TTGCATCCACA 


Antisense 3' 
Sense 3' 


1049 


SEQ ID NO:64 
SEQIDNO:69 


CATGGCTGCTT 
AAGCAGCCATG 


Antisense 3' 
Sense 3' 


1164 


SEQ ID NO:65 
SEQ ID NO:70 


AGGGATCTCCC 
GGGAGATCCCT 


Antisense 3' 
Sense 3 ' 


1526 


SEQ ID NO:66 
SEQ ID NO:71 


TAAACACCTTT 
AAAGGTGTTTA 


Antisense 3' 
Sense 3' 


1661 & 1662 


SEQ ID NO:67 
SEQ ID NO:72 


TGTTCTTTATA 
TATAAAGAACA 


Antisense 3' 
Sense 3' 
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CLAIMS 

1. An oligonucleotide primer pair suitable for amplifying a polymorphic 
region of a 5 'flanking region of a CYP3A4 gene, wherein the polymorphic region 
corresponds to position 816 of SEQ ID NO:L 

5 

2. The primer pair of claim 1, having sequences selected from the group 
consisting of SEQ ID NO: 7 and SEQ ID NO:8 and SEQ ID NO:9 and SEQ ID NO: 10. 

3. A sequence determination oligonucleotide for detecting a polymorphic site 
10 in a 5' flanking region of a CYP3A4 gene, said oligonucleotide being complementary to 

the polymorphic region corresponding to position 461 of SEQ ID NO:L 

4. The oligonucleotide of claim 3, comprising a sequence selected from the 
group consisting of SEQ ID NO:ll; SEQ ID NO:12; SEQ ID NO:13; SEQ ID NO:14; 

15 SEQ ID NO:15; SEQ ID NO:16; SEQ ID NO:17; and SEQ ID NO:18. 

5. A kit comprising at least one oligonucleotide primer pair capable of 
amplifying the region corresponding to position 461 of SEQ ID NO:L 

20 6. The kit of claim 5, wherein the primer pair comprises sequences selected 

from the group consisting of SEQ ID NO: 7 and SEQ ID NO:8 and SEQ ID NO:9 and 
SEQ ID NO: 10. 

7. The kit of claim 5, further comprising a sequence determination 

25 oligonucleotide complementary to the polymorphic region corresponding to position 461 
of SEQIDNO:l. 

8. The kit of claim 7, wherein the oligonucleotide comprises a sequence 
selected from the group consisting of SEQ ID NO:ll; SEQ ID NO: 12; SEQ ID NO: 13; 

30 SEQ ID NO: 14; SEQ ID NO: 15; SEQ ID NO: 16; SEQ ID NO: 17; and SEQ ID NO: 18. 
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9. An oligonucleotide primer pair suitable for amplifying a polymorphic 
region of a 5' flanking region of a CYP2C9 gene, wherein the polymorphic region 
corresponds to position 957 of SEQ ID NO:6; position 1049 of SEQ ID NO:6; position 
1164 of SEQ ID NO:6; position 1526 of SEQ ID NO:6; position 1661 of SEQ ID NO:6; 
5 and position 1662 of SEQ ID NO:6. 



10. The primer pair of claim 9, having a sequence selected from the group 
consisting of SEQ ID NO: 19 and SEQ ID NO:20; SEQ ID NO:21 and SEQ ID NO:22; 
SEQ ID NO:23 and SEQ ID NO:24; SEQ ID NO:25 and SEQ ID NO:26; SEQ ID NO:27 
10 and SEQ ID NO:28; SEQ ID NO:29 and SEQ ID NO:30; and SEQ ID NO:31 and SEQ 
ID NO:32. 



11. A sequence determination oligonucleotide for detecting a polymorphic site 
in a 5' flanking region of a CYP2C9 gene, said oligonucleotide comprising a sequence 

15 selected from the group consisting of an oligonucleotide complementary to the 

polymorphic region corresponding to position 957 of SEQ ID NO:6; an oligonucleotide 
complementary to the polymorphic region corresponding to position 1049 of SEQ ID 
NO:6; an oligonucleotide complementary to the polymorphic region corresponding to 
position 1164 of SEQ ID NO: 6; an oligonucleotide complementary to the polymorphic 

20 region corresponding to position 1526 of SEQ ID NO:6; an oligonucleotide 

complementary to the polymorphic region corresponding to position 1661 of SEQ ID 
NO:6; and an oligonucleotide complementary to the polymorphic region corresponding to 
position 1662 of SEQ ID NO:6. 



25 12. The oligonucleotide of claim 11, comprising a sequence selected from the 

group consisting of SEQ ID NO:33; SEQ ID NO:34; SEQ JD NO:35; SEQ ID NO:36; 
SEQ ID NO:37; SEQ ID NO:38; SEQ ID NO:39; SEQ ID NO:40; SEQ ID NO:41; SEQ 
ID NO:42; SEQ ID NO:43; SEQ ID NO:44; SEQ ID NO:45; SEQ ID NO:46; SEQ ID 
NO:47; SEQ ID NO:48; SEQ ID NO:49; SEQ ID NO:50; SEQ ID NO:51; SEQ ID 

30 NO:52; SEQ ID NO:53; SEQ ID NO:54; SEQ ID NO:55; SEQ ID NO:56; SEQ ID 
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NO:57; SEQ ID NO:58; SEQ ID NO:59; SEQ ID NO:60; SEQ JD NO:61; SEQ ID 
NO:62; SEQ ID NO:63; SEQ ID NO:64; SEQ ID NO:65; SEQ ID NO:66; SEQ ID 
NO:67 and SEQ ID NO:68. 

5 13. A kit comprising at least one oligonucleotide primer pair, wherein the 

primer pair is capable of amplifying a polymorphic region selected from the group 
consisting of the polymorphic region corresponding to position 957 of SEQ ID NO:6; the 
polymorphic region corresponding to position 1049 of SEQ ID NO:6; the polymorphic 
region corresponding to position 1164 of SEQ ID NO:6; the polymorphic region 
10 corresponding to position 1526 of SEQ ID NO:6; the polymorphic region corresponding 
to position 1661 of SEQ ID NO:6; and the polymorphic region corresponding to position 
1662 ofSEQIDNO:6. 
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FIGURE 1 

1 CTGCAGTGAC CACTGCCCCA TCATTGCTGG CTGAGGTGGT TGGGGTCCAT CTGGCTATCT 
61 GGGCAGCTGT TCTCTTCTCT CCTTTCTCTC CTGTTTCCAG ACATGCAGTA TTTCCAGAGA 
121 GAAGGGGC C A CTCTTTGGCA AAGAACCTGT CTAACTTGCT ATCTATGGCA GGACCTTTGA 
181 AGGGTTCACA GGAAGCAGCA CAAATTGATA CTATTCCACC AAGC CATC AG CTCCATCTCA 
241 TCCATGCCCT GTCTCTCCTT TAGGGGTCCC CTTGCCAACA GAATCACAGA GGACCAGCCT 
3 01 GAAAGTGCAG AGACAGCAGC TGAGGCACAG CCAAGAGCTC TGGCTGTATT AATGACCTAA 
3 61 GAAGTC AC C A GAAAGTCAGA AGGATGCATA GCAGAGGCCC AGCAATCTCA GCTAAGTCAA 
421 CTCCACCAGC CTTTCTAGTT GCCCACTGTG TGTACAGCAC SCTGGTAGGG ACCAGAGCCA 
481 TGACAGGGAA TAAGACTAGA CTATGCCCTT GAGGAGCTCA CCTCTGTTCA GGGAAACAGG 
541 CGTGGAAACA CAATGGTGGT AAAGAGGAAA GAGGACAATA GGATTGCATG AAGGGGATGG 
601 AAAGTGC C C A GGGGAGGAAA TGGTTACATC TGTGTGAGGA GTTTGGTGAG GAAAGAC TCT 
661 AAGAGAAGGC TCTGTCTGTC TGGGTTTGGA AGGATGTGTA GGAGTCTTCT AGGGGGCACA 
721 GGCACACTCC AGGCATAGGT AAAGATCTGT AGGTGTGGCT TGTTGGGATG AATTTCAAGT 
7 81 ATTTTGGAAT GAGGACAGCC ATAGAGACAA GGGCARGAGA GAGGC GATTT AATAGATTTT 
841 ATGCCAATGG CTCCACTTGA GTTTC TGAT A AGAACCCAGA ACCCTTGGAC TCCCCAGTAA 
901 CATTGATTGA GTTGTTTATG ATACCTCATA GAATATGAAC TCAAAGGAGG TCAGTGAGTG 
9 61 GTGTGTGTGT GATTCTTTGC CAACTTCCAA GGTGGAGAAG CCTCTTCCAA CTGCAGGCAG 
1021 AGCACAGGTG GCCCTGCTAC TGGCTGCAGC TCCAGCCCTG CCTCCTTCTC TAGCATATAA 
1081 ACAATCCAAC AGCCTCACTG AATCACTGCT GTGCAGGGCA GGAAAGCTCC ATGCACATAG 
1141 CCCAGCAAAG AGCAACACAG AGCTGAAAGG AAGACTCAGA GGAGAGAGAT AAGTAAGGAA 
12 01 AGTAGTGATG GCTCTCATCC CAGACTTGGC CATGGAAACC TGGCTTCTCC TGGCTGTCAG 
12 61 CCTGGTGCTC CTCTATCTGT GAGTAACTGT TCAGGCTCCT CTTCTCTGTT TCTTGGACTT 
1321 GGGGTC GTAA TCAGGCCTCT CTTTT 
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FIGURE 2 

1 GATCTCAGAT ATCCCTTCTA TCTACACATT ATCTATAATT CTTTCTTTCT GTAAACTGAA 
61 AGGTCCTAGA AGGAGCC GC A GCTCAGCAGG AGAGAGGAGG AGCTGAGCTG GGACCCCTAC 
121 CTCCTGAGGA ATGAAATGAT TATTATAAAG ACAGCAACCG AGCTTATTTT ACCCAAAATA 
181 AGGTAGTATA TTTCTGTTAG AGTTTAGAGT TTCATGAGTC AGGGACCAAG TTATTGCTTT 
241 TCTTTGCCCT GTATAAAGGC TTCTCCAAGG CCTTTGACTT AC CTAAGTAC TAAATGTTAT 
3 01 AAAACCAAAC TCTTCTGACC TCTCAATCTA GTCAACTGGG GCTGTAATTA TTAATGAAAT 
3 61 TAATGTTTAT TTTGAAAATA ATTTAC TAGA CTGAATTACG AAATCCTGAA TCATTGTACA 
421 CTATCAGTAA ATATTGGTGG ACCCAACTGA AC TGAATGTT TTGCTTGAAA TGAAACCTTT 
481 GAGATGCAGG GCTTATGGGT TCTAGTCCCA GCTCTAGCAC TAGCAGACAG CATGTTCTTG 
541 GCTAAGATAC TGAATCTTCA AGGCTCAGCT TCCTCATTCC GGAAATGGGT CAATTTTATT 
601 GTAAGCAGAG GTAATTGAGA GATTCAAAAG GGACATGAGG TGTAACAATT CTCTGTAAAT 
661 TGTTAGAATC CCTGTTAAAA ATGACCAGTA AAGCTTTGTG CAACTGTGTC TTGACATAAC 
721 TTTATTTTTC TTAATAAAAG AAATGGAAAT AACCTCACTA GGGAATTTAG AACAAATATG 
7 81 ATGATATCTT TAAAGAAAAT GGCTTTGCAC AAGTATTGAC ATTAATGATC TAGTAAAGTG 
841 TATCTTTCTA GTTGTATTTA GATCCTCAAC TCAGTATGTC AGCTCCTGTT AAGGTC TATA 
901 CATTGTGGTG GTTCTGTGCT GTGGGTCCAT TTAGTGATTT CCCTACCTCC CATCTTYTAT 
961 TGCATCCACA ACTGTGGTTC TGTCCATAAT TTCCTTTGCT TTCTGTGCAT TATTACATCA 
1021 TATCTGAAAA TGAGAAAC C A AAAACAATRG AAAGCAGCCA TGTCTGGAGG TGACTGGGGG 
1081 GTCGAGAAGC CCTAGTTTCT CAAACCCTTA GCACCAAATT TTTCCCTCAG TTACACTGAG 
1141 CGTTTC AC TT CTGCAGTGAT GGARAAGGGA GATCCCTTAT TTCTTCTCAT GAGCATCTCT 
12 01 GGTGCTGTTT CCCTTAGAGA CAAATAAGGG GTTCTATTTA ATGTGAAGCC TGTTTTATGA 

12 61 ACAGAATAAA TGTGGTGTAT ATTCAGAATA AC TAATGTTT GGAAGTTGTT TTATTTTTGC 
1321 TAAAAATTGT TCTCAAGGCA GCTCTGGTGT AAGAGATAAT ACACCACGAT GGGCATCAGA 

13 81 AGACCTCAGC TCAAATCCCA GTTCTGCCAG CTATGAGCTG TGTGGCACCA ACAGGTGTCC 
1441 TGTTCTCCCA GGGTCTCCCT TTTCCCATTT GAAAAATAAA AAATAACAAT TCCTGCCTTC 
1501 AGGAATTTTT TTTAGGGGGT TTAATKGTAA AGGTGTTTAT ATCTGCTAAG GTAATTTACT 
1561 TGATATATGT TTGGTTATTT AAGATATATG AGTTATGTTA GCTATTTCAT GTTTAGGCTG 
1621 CTGTATTTTT AGTAGGC TAT ATTAAATATT TGAAAGGATT WMATTATAAA GAACAAAGTC 
1681 TCCTAATCTT TGATATAGCA TTGACATACT TTTTAAATAT ACAAGGCATA GAATATGGCC 
1741 ATTTCTGTTA AATCATATAT TCCCAACTGG TTATTAATCT AAGAATTCAG AATTTTGAGT 
1801 AATTGC TTTT GCATCAGATT ATTTAC TTC A GTGCTCTCAA TTATGATGGT GCATTAGAAC 
1861 CATC TGGGTT AACATTTGTT TTTTATTACC AATACCTAGG CTCCAACCAA GTACAGTGAA 
1921 AC TGGAATGT ACAGAGTGGA CAATGGAACG AAGGAGAACA AGACCAAAGG ACATTTTATT 
1981 TTTATC TGTA TCAGTGGGTC AAAGTC CTTT CAGAAGGAGC ATATAGTGGA CCTAGGTGAT 
2041 TGGTCAATTT ATCCATCAAA GAGGCACACA CCGAATTAGC ATGGAGTGTT ATAAAAGGCT 
2101 TGGAGTGCAA GCTCATGGTT GTCTTAACAA GAAGAGAAGG C TTC AATGGA TTCTCTTGTG 
2161 GTCCTTGTGC TCTGTCTCTC ATGTTTGCTT CTCCTTTCAC TCTGGAGACA GAGCTCTGGG 
2221 AGAGGAAAAC TCCCTCCTGG CCCCACTCCT CTCCCAGTGA TTGGAAATAT CCTACAGATA 
2281 GGTATTAAGG ACATCAGCAA ATCCTTAACC AATGTAAGTA TGCTCCTTCA GTGGCTTGCA 
2341 AAAGGTAAGT AAATTC AC C T GTATTTTTTA AATAAAGTGT ATCCCTAGAG GTACATGTTA 
2401 CAAGAGGTAA TGGTAAAGTA AAAT AC TTTG AAAGGCTT 
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SEQUENCE LISTING 

<110> Gemini Genomics pic 

<120> Detection of CYP3A4 and CYP2C9 Polymorphisms 

<130> GG119.3PCT 

<150> GB 0021286.0 

<151> 2000-08-30 

<160> 72 

<170> Patentln version 3.1 

<210> 1 

<211> 1345 

<212> DNA 

<213> homo sapiens 



<400> 1 
ctgcagtgac 


cactgcccca 


tcattgctgg 


ctgaggtggt 


tggggtccat 


ctggctatct 


60 


gggcagctgt 


tctcttctct 


cctttctctc 


ctgtttccag 


acatgcagta 


tttccagaga 


120 


gaaggggcca 


ctctttggca 


aagaacctgt 


ctaacttgct 


atctatggca 


ggacctttga 


180 


agggttcaca 


ggaagcagca 


caaattgata 


ctattccacc 


aagccatcag 


ctccatctca 


240 


tccatgccct 


gtctctcctt 


taggggtccc 


cttgccaaca 


gaatcacaga 


ggaccagcct 


300 


gaaagtgcag 


agacagcagc 


tgaggcacag 


ccaagagctc 


tggctgtatt 


aatgacctaa 


360 


gaagtcacca 


gaaagtcaga 


aggatgcata 


gcagaggccc 


agcaatctca 


gctaagtcaa 


420 


ctccaccagc 


ctttctagtt 


gcccactgtg 


tgtacagcac 


sctggtaggg 


accagagcca 


480 


tgacagggaa 


taagactaga 


ctatgccctt 


gaggagctca 


cctctgttca 


gggaaacagg 


540 


cgtggaaaca 


caatggtggt 


aaagaggaaa 


gaggacaata 


ggattgcatg 


aaggggatgg 


600 


aaagtgccca 


ggggaggaaa 


tggttacatc 


tgtgtgagga 


gtttggtgag 


gaaagactct 


660 
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aagagaaggc 


tctgtctgtc 


tgggtttgga aggatgtgta ggagtcttct 


agggggcaca 


720 


ggcacactcc 


aggcataggt 


aaagatctgt 


aggtgtggct tgttgggatg 


aatttcaagt 


780 


attttggaat 


gaggacagcc 


atagagacaa gggcargaga 


gaggcgattt 


aatagatttt 


840 


atgccaatgg 


ctccacttga 


gtttctgata agaacccaga 


acccttggac 


tccccagtaa 


900 


cattgattga 


gttgtttatg 


atacctcata 


gaatatgaac 


tcaaaggagg 


tcagtgagtg 


960 


gtgtgtgtgt 


gattctttgc 


caacttccaa 


ggtggagaag 


cctcttccaa 


ctgcaggcag 


1020 


agcacaggtg 


gccctgctac 


tggctgcagc 


tccagccctg 


cctccttctc 


tagcatataa 


1080 


acaatccaac 


agcctcactg 


aatcactgct 


gtgcagggca 


ggaaagctcc 


atgcacatag 


1140 


cccagcaaag 


agcaacacag 


agctgaaagg 


aagactcaga 


ggagagagat 


aagtaaggaa 


1200 


agtagtgatg 


gctctcatcc 


cagacttggc 


catggaaacc 


tggcttctcc 


tggctgtcag 


1260 


cctggtgctc 


ctctatctgt 


gagtaactgt 


tcaggctcct 


cttctctgtt 


tcttggactt 


1320 


ggggtcgtaa 


tcaggcctct 


ctttt 
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<210> 2 

<211> 19 

<212> DNA 

<213> synthetic 



<400> 2 

acaagggcaa gagagaggc 19 

<210> 3 

<211> 19 

<212> DNA 

<213> synthetic 



<400> 3 

acaagggcag gagagaggc 19 

<210> 4 

<211> 10 

<212> DNA 

<213> syntheti c 
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<400> 4 

agggcaagag 10 

<210> 5 

<211> 10 

<212> DNA 

<213> synthetic 



<400> 5 

agggcaggag 10 

<210> 6 

<211> 2438 

<212> DNA 

<213> homo sapiens 



<400> 6 
gatctcagat 


atcccttcta 


tctacacatt 


atctataatt 


ctttctttct 


gtaaactgaa 


60 


aggtcctaga 


aggagccgca 


gctcagcagg 


agagaggagg 


agctgagctg 


ggacccctac 


120 


ctcctgagga 


atgaaatgat 


tattataaag 


acagcaaccg 


agcttatttt 


acccaaaata 


180 


aggtagtata 


tttctgttag 


agtttagagt 


ttcatgagtc 


agggaccaag 


ttattgcttt 


240 


tctttgccct 


gtataaaggc 


ttctccaagg 


cctttgactt 


acctaagtac 


taaatgttat 


300 


aaaaccaaac 


tcttctgacc 


tctcaatcta 


gtcaactggg 


gctgtaatta ttaatgaaat 


360 


taatgtttat 


tttgaaaata 


atttactaga 


ctgaattacg 


aaatcctgaa 


tcattgtaca 


420 


ctatcagtaa 


atattggtgg 


acccaactga 


actgaatgtt 


ttgcttgaaa 


tgaaaccttt 


480 


gagatgcagg 


gcttatgggt 


tctagtccca 


gctctagcac 


tagcagacag 


catgttcttg 


540 


gctaagatac 


tgaatcttca 


aggctcagct 


tcctcattcc 


ggaaatgggt 


caattttatt 


600 


gtaagcagag 


gtaattgaga 


gattcaaaag 


ggacatgagg 


tgtaacaatt 


ctctgtaaat 


660 


tgttagaatc 


cctgttaaaa 


atgaccagta aagctttgtg 


caactgtgtc 


ttgacataac 


720 


tttatttttc 


ttaataaaag 


aaatggaaat 


aacctcacta 


gggaatttag 


aacaaatatg 


780 


atgatatctt 


taaagaaaat 


ggctttgcac 


aagtattgac 


attaatgatc 


tagtaaagtg 


840 


tatctttcta 


gttgtattta 


gatcctcaac tcagtatgtc 


agctcctgtt 


aaggtctata 


900 


cattgtggtg 


gttctgtgct 


gtgggtccat ttagtgattt 


ccctacctcc 


catcttytat 


960 


tgcatccaca 


actgtggttc 


tgtccataat ttcctttgct 


ttctgtgcat tattacatca 


1020 
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tatctgaaaa 


tgagaaacca 


aaaacaatrg 


aaagcagcca 


tgtctggagg 


tgactggggg 


1080 


gtcgagaagc 


cctagtttct 


caaaccctta 


gcaccaaatt 


tttccctcag 


ttacactgag 


1140 


cgtttcactt 


ctgcagtgat 


ggaraaggga 


gatcccttat 


ttcttctcat 


gagcatctct 


1200 


ggtgctgttt 


cccttagaga 


caaataaggg 


gttctattta 


atgtgaagcc 


tgttttatga 


1260 


acagaataaa 


tgtggtgtat 


attcagaata 


actaatgttt 


ggaagttgtt 


ttatttttgc 


1320 


taaaaattgt 


tctcaaggca 


gctctggtgt 


aagagataat 


acaccacgat 


gggcatcaga 


1380 


agacctcagc 


tcaaatccca 


gttctgccag 


ctatgagctg 


tgtggcacca 


acaggtgtcc 


1440 


tgttctccca 


gggtctccct 


tttcccattt 


gaaaaataaa 


aaataacaat 


tcctgccttc 


1500 


aggaattttt 


tttagggggt 


ttaatkgtaa 


aggtgtttat 


atctgctaag 


gtaatttact 


1560 


tgatatatgt 


ttggttattt 


aagatatatg 


agttatgtta 


gctatttcat 


gtttaggctg 


1620 


ctgtattttt 


agtaggctat 


attaaatatt 


tgaaaggatt 


wmattataaa 


gaacaaagtc 


1680 


tcctaatctt 


tgatatagca 


ttgacatact 


ttttaaatat 


acaaggcata 


gaatatggcc 


1740 


atttctgtta 


aatcatatat 


tcccaactgg 


ttattaatct 


aagaattcag 


aattttgagt 


1800 


aattgctttt 


gcatcagatt 


atttacttca 


gtgctctcaa 


ttatgatggt 


gcattagaac 


1860 


catctgggtt 


aacatttgtt 


ttttattacc 


aatacctagg 


ctccaaccaa 


gtacagtgaa 


1920 


actggaatgt 


acagagtgga 


caatggaacg 


aaggagaaca 


agaccaaagg 


acattttatt 


1980 


tttatctgta 


tcagtgggtc 


aaagtccttt 


cagaaggagc 


atatagtgga 


cctaggtgat 


2040 


tggtcaattt 


atccatcaaa 


gaggcacaca 


ccgaattagc 


atggagtgtt 


ataaaaggct 


2100 


tggagtgcaa 


gctcatggtt 


gtcttaacaa 


gaagagaagg 


cttcaatgga 


ttctcttgtg 


2160 


gtccttgtgc 


tctgtctctc 


atgtttgctt 


ctcctttcac 


tctggagaca 


gagctctggg 


2220 


agaggaaaac 


tccctcctgg 


ccccactcct 


ctcccagtga 


ttggaaatat 


cctacagata 


2280 


ggtattaagg 


acatcagcaa 


atccttaacc 


aatgtaagta 


tgctccttca gtggcttgca 


2340 


aaaggtaagt 


aaattcacct 


gtatttttta 


aataaagtgt 


atccctagag 


gtacatgtta 


2400 


caagaggtaa 


tggtaaagta 


aaatactttg 


aaaggctt 
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<210> 7 

<211> 20 

<212> DNA 

<213> synthetic 

<400> 7 

ccagcctgaa agtgcagaga 20 
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<210> 8 

<211> 25 

<212> DNA 

<213> synthetic 

<400> 8 
. tcttagagtc tttcctcacc aaact 

<210> 9 

<211> 20 

<212> DNA 

<213> synthetic 

<400> 9 

catgccctgt ctctccttta 

<210> 10 

<211> 19 

<212> DNA 

<213> synthetic 

<400> 10 

ccatcccctt catgcaatc 

<210> 11 

<211> 11 

<212> DNA 

<213> synthetic 

<400> 11 
agcaccctgg t 

<210> 12 

<211> 11 

<212> DNA 

<213> synthetic 
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19 
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<400> 12 
agcacgctgg t 



11 



<210> 13 

<211> 11 

<212> DNA 

<213> synthetic 

<400> 13 

accagggtgc t 11 

<210> 14 

<211> 11 

<212> DNA 

<213> synthetic 



<210> 15 

<211> 11 

<212> DNA 

<213> synthetic 

<400> 15 

gtgtgtacag c 11 

<210> 16 

<211> 11 

<212> DNA 

<213> synthetic 



<400> 14 
accagcgtgc t 



11 



<400> 16 
gctgtacaca c 



11 
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<210> 17 

<211> 11 

<212> DNA 

<213> synthetic 

<400> 17 
tggtccctac c 

<210> 18 

<211> 11 

<212> DNA 

<213> synthetic 

<400> 18 
ggtagggacc a 

<210> 19 

<211> 25 

<212> DNA 

<213> synthetic 

<400> 19 

cactagggaa tttagaacaa atatg 

<210> 20 

<211> 23 

<212> DNA 

<213> synthetic 

<400> 20 

gcacagaaag caaaggaaat tat 

<210> 21 

<211> 27 

<212> DNA 

<213> syntheti c 
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11 



11 



25 



23 
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<400> 21 

tgtatttaga tcctcaactc agtatgt 



27 



<210> 22 

<211> 21 

<212> DNA 

<213> synthetic 

<400> 22 

ggatctccct tctccatcac t 21 

<210> 23 

<211> 23 

<212> DNA 

<213> synthetic 



<210> 24 

<211> 25 

<212> DNA 

<213> synthetic 

<400> 24 

atacaccaca tttattctgt teat a 25 

<210> 25 

<211> 22 

<212> DNA 

<213> synthetic 



<400> 23 

ggtccattta gtgatttccc tac 



23 



<400> 25 

ccaaattttt ccctcagtta ca 



22 
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<210> 26 

<213> 20 

<212> DNA 

<213> synthetic 

<400> 26 

ttggtgccac acagctcata 

<210> 27 

<211> 20 

<212> DNA 

<213> synthetic 

<400> 27 

gccttcagga atttttttta 

<210> 28 

<211> 25 

<212> DNA 

<213> synthetic 

<400> 28 

ccagttggga atatatgatt taaca 

<210> 29 

<211> 25 

<212> DNA 

<213> synthetic 

<400> 29 

gctgctgtat ttttagtagg ctata 

<210> 30 

<211> 22 

<212> DNA 

<213> synthetic 
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20 



25 



25 
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<400> 30 

cgttccattg tccactctgt ac 22 

<210> 31 

<211> 20 

<212> DNA 

<213> synthetic 

<400> 31 

tcaaggcagc tctggtgtaa 20 

<210> 32 

<211> 25 

<212> DNA 

<213> synthetic 

<400> 32 

agttgggaat atatgattta acaga 25 

<210> 33 

<211> 11 

<212> DNA 

<213> synthetic 



<400> 33 

atcttctatt g 11 

<210> 34 

<211> 11 

<212> DNA 

<213> synthetic 



<400> 34 

atcttttatt g 11 
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<210> 35 

<211> 11 

<212> DNA 

<213> synthetic 

<400> 35 
acaatagaaa g 

<210> 36 

<211> 11 

<212> DNA 

<213> synthetic 

<400> 36 
acaatggaaa g 

<210> 37 

<211> 11 

<212> DNA 

<213> synthetic 

<400> 37 
atggagaagg g 

<210> 38 

<211> 11 

<212> DNA 

<213> synthetic 

<400> 38 
atggaaaagg g 

<210> 39 

<211> 11 

<212> DNA 

<213> synthetic 



11 



11 



11 



11 



11 
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<400> 39 
ttaatggtaa a 

<210> 40 
<211> 11 
<212> DNA 

<213> synthetic 



<400> 40 
ttaattgtaa a 

<210> 41 

<211> 12 

<212> DNA 

<213> synthetic 



<400> 41 
ggatttcatt at 

<210> 42 

<211> 12 

<212> DNA 

<213> synthetic 

<400> 42 
ggattaaatt at 

<210> 43 

<211> 11 

<212> DNA 

<213> syntheti c 



<400> 43 
caatagaaga t 



11 



11 



12 



12 



11 



12 
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<210> 44 

<211> 11 

<212> DNA 

<213> synthetic 

<400> 44 
caataaaaga t 

<210> 45 

<211> 11 

<212> DNA 

<213> synthetic 

<400> 45 
ctttctattg t 

<210> 46 

<211> 11 

<212> DNA 

<213> synthetic 

<400> 46 
ctttccattg t 

<210> 47 

<211> 11 

<212> DNA 

<213> syntheti c 

<400> 47 
cccttctcca t 

<210> 48 

<211> 11 

<212> DNA 

<213> synthetic 



11 



11 



11 



11 
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<400> 48 
cccttttcca t 



11 



<210> 49 

<211> 11 

<212> DNA 

<213> syntheti c 

<400> 49 

tttaccatta a 11 

<210> 50 

<211> 11 

<212> DNA 

<213> synthetic 



<210> 51 

<211> 12 

<212> DNA 

<213> synthetic 

<400> 51 

ataatgaaat cc 12 

<210> 52 

<211> 12 

<212> DNA 

<213> syntheti c 



<400> 50 
tttacaatta a 



11 



<400> 52 
ataatttaat cc 



12 
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<210> 53 

<211> 11 

<212> DNA 

<213> synthetic 

<400> 53 
tacctcccat c 

<210> 54 

<211> 11 

<212> DNA 

<213> synthetic 

<400> 54 
aaccaaaaac a 

<210> 55 

<211> 11 

<212> DNA 

<213> synthetic 

<400> 55 
ctgcagtgat g 

<210> 56 

<211> 11 

<212> DNA 

<213> syntheti c 

<400> 56 
tagggggttt a 

<210> 57 

<211> 11 

<212> DNA 

<213> synthetic 



11 



11 



11 



11 
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<400> 57 
atttgaaagg a 

<210> 58 

<211> 11 

<212> DNA 

<213> synthetic 



<400> 58 
gatgggaggt a 

<210> 59 

<211> 11 

<212> DNA 

<213> synthetic 



<400> 59 
tgtttttggt t 

<210> 60 

<211> 11 

<212> DNA 

<213> synthetic 

<400> 60 
catcactgca g 

<210> 61 

<211> 11 

<212> DNA 

<213> synthetic 



<400> 61 
taaaccccct a 



11 



11 



11 



11 



11 
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<210> 62 

<211> 11 

<212> DNA 

<213> syntheti c 

<400> 62 
tcctttcaaa t 

<210> 63 

<211> 11 

<212> DNA 

<213> synthetic 

<400> 63 
tgtggatgca a 

<210> 64 

<211> 11 

<212> DNA 

<213> synthetic 

<400> 64 
catggctgct t 

<210> 65 

<211> 11 

<212> DNA 

<213> synthetic 

<400> 65 
agggatctcc c 

<210> 66 

<211> 11 

<212> DNA 

<213> synthetic 



11 



11 



11 



11 
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<400> 66 
taaacacctt t 



11 



<210> 67 

<211> 11 

<212> DNA 

<213> synthetic 

<400> 67 

tgttctttat a 11 

<210> 68 

<211> 11 

<212> DNA 

<213> synthetic 



<210> 69 

<211> 11 

<212> DNA 

<213> synthetic 

<400> 69 

aagcagccat g 11 

<210> 70 

<211> 11 

<212> DNA 

<213> synthetic 



<400> 68 
ttgcatccac a 



11 



<400> 70 
gggagatccc t 



11 
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<210> 71 

<211> 11 

<212> DNA 

<213> synthetic 

<400> 71 
aaaggtgttt a 

<210> 72 

<211> 11 

<212> DNA 

<213> synthetic 

<400> 72 
tataaagaac a 
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